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PASSIVE RELEASE AVOIDANCE TECHNIQUE 

CROSS-REFERENCE TO RELATED APPLICATIONS 

The present application claims priority from U.S. Provisional Patent Application 
Serial No. 60/229,830, which was filed on August 31, 2000, by Sam Duncan, Dave 
Golden, Darrel Donaldson, Dave Hartwell, Steve Ho, Andrej Kovec, Jeff Willcox and 
Roger Pannell for a SYMMETRICAL MULTIPROCESSOR COMPUTER SYSTEM 
and is hereby incorporated by reference in its entirety. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

This invention relates to multiprocessor computer architectures and, more specifi- 
cally, to the handling and processing of interrupts by multiprocessor computer systems. 

Background Information 

Distributed shared memory computer systems, such as symmetrical multiproces- 
sor (SMP) systems, support high performance application processing. Conventional SMP 
systems include a plurality of processors coupled together by a bus. One characteristic of 
SMP systems is that memory space is typically shared among all of the processors. That 
is, each processor accesses programs and data in the shared memory, and processors 
communicate with each other via that memory (e.g., through messages and status infor- 
mation left in shared address spaces). In some SMP systems, the processors may also be 
able to exchange signals directly. One or more operating systems are typically stored in 
the shared memory. These operating systems control the distribution of processes or 
threads among the various processors. The operating system kernels may execute on any 
processor, and may even execute in parallel. By allowing many different processors to 
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execute different processes or threads simultaneously, the execution speed of a given ap- 
plication may be greatly increased. 

During operation, entities or "agents" of the computer system often generate inter- 
rupt signals in response to a need for service or the detection of an error by those entities 
or agents, which may include processors, memory controllers, input/output (I/O) devices, 
etc. For example, when an I/O device requests some particular type of service or action 
from a processor, it often signals the processor by issuing an interrupt. Specifically, the 
I/O device may assert an interrupt input pin of either of the processor or of an interrupt 
controller, which forwards the interrupt to the processor. Provided the interrupt is per- 
mitted, it will be acknowledged by the processor at the end of the current processor cycle. 
The processor then services the interrupt typically by branching to a special service rou- 
tine written to handle that particular interrupt. Upon servicing the interrupt, the processor 
signals the I/O device that it completed the. interrupt, typically with a write to the device, 
and the I/O device deasserts the interrupt signal. The processor then signals to the inter- 
rupt controller that the interrupt event has been serviced, typically with an end of inter- 
rupt signal. 

Multiprocessor, distributed memory systems may have many I/O devices gener- 
ating large numbers of interrupts. The Peripheral Component Interface (PCI) specifica- 
tion standard, for example, defines four level sensitive interrupts (INTA, INTB, INTC 
and INTD) that can be generated by a single, multifunction, I/O device. In many com- 
puter systems, the number of interrupts that can be supported is limited. As a result, a 
given interrupt signal, especially a level sensitive interrupt (LSI), may be shared by more 
than one I/O device. If two devices are asserting a given LSI, it will remain asserted until 
both devices have been serviced. 

The process of collecting interrupts and sending them to the processors for serv- 
icing can be complex. Inefficient processing of interrupts can result in reduced efficiency 
of the system. For example, in some implementations, it is possible for the end of inter- 
rupt to be detected prior to the I/O device deasserting the interrupt signal. This may 
cause the system to believe the interrupt signal has been asserted a second time even 
though there is no work to be done. Because interrupt signals can be shared, it is not suf- 
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ficient simply to wait for the interrupt signal to be deasserted as a second interrupt may 
be signaled immediately after the first and/or may overlap with the prior interrupt, re- 
sulting in a continuously asserted interrupt signal. The occurrence of interrupt signaling 
when no service is actually requested is known as "passive release", and reduces the effi- 
ciency of the computer system. However, Prior art computer systems were built to ex- 
pect and tolerate the passive release of interrupts, even though this would result in de- 
graded system performance. Accordingly, a need exists for efficiently collecting and 
processing interrupts in large, multiprocessor computer systems. 



Briefly, the present invention is directed to a system and method for avoiding pas- 
sive release of interrupts in a multiprocessor computer system. The computer system in- 
cludes a plurality of processors, a memory subsystem, a plurality of input/output (I/O) 
devices each capable of issuing interrupts, and an I/O bridge interfacing between the I/O 
devices and the processors. The I/O bridge includes a plurality of data ports for con- 
necting to the I/O devices themselves, and an interrupt port for collecting the interrupts 
generated by the I/O devices. The I/O devices initiate interrupts by asserting an interrupt 
signal, such as a level sensitive interrupt (LSI) wire or line that leads to the interrupt port. 

Upon detecting the assertion of an interrupt signal, the interrupt controller gener- 
ates a write message containing the interrupt and sends it to a designated processor for 
servicing the interrupt. The interrupt controller then sets an interrupt pending flag associ- 
ated with the interrupt. The designated processor services the interrupt and generates a 
first write message that is directed to the I/O device that initiated the interrupt. The proc- 
essor designates the first write message as an ordered transaction and sends it to the port 
of the I/O bridge to which the device is coupled. The processor also generates a second, 
ordered write message. The second write message directs the interrupt controller to clear 
the interrupt pending flag that was set in response to the interrupt from the I/O device. 
The processor also sends the second write message to the port to which the I/O device is 
coupled, after sending the first write message. 



SUMMARY OF THE INVENTION 
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The first and second write messages are received at the respective port of the I/O 
bridge, which includes an ordering engine that maintains the order of messages received 
from the processors. Since it was received first, the ordering engine forwards the first 
write message to the I/O device, which responds by deasserting the interrupt signal. Af- 
5 ter forwarding the first write message, the ordering engine then passes the second write 
message to the interrupt controller at the I/O bridge's interrupt port. The interrupt con- 
troller responds by clearing the interrupt pending flag. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention description below refers to the accompanying drawings, of which: 
Fig. 1 is a schematic block diagram of a symmetrical multiprocessor computer 
system in accordance with the present invention; 

Fig. 2 is a schematic block diagram of a dual processor module of the computer 
system of Fig. 1; 

Fig. 3 is a schematic block diagram of an I/O subsystem of the computer system 
of Fig. 1; 

Fig. 4 is a schematic block diagram of a data port of the I/O subsystem of Fig. 3; 
Fig. 5 is a highly schematic block diagram of an interrupt control system in ac- 
cordance with the present invention; and 

Figs. 6A-B are a flow diagram of the method of the present invention. 

DETAILED DESCRIPTION OF AN ILLUSTRATIVE 

EMBODIMENT 

Fig. 1 is a schematic block diagram of a symmetrical multiprocessor (SMP) sys- 
tem 100 comprising a plurality of processor modules 102 interconnected to form a two 
dimensional (2D) torus or mesh configuration. Each processor module 102 preferably 
25 comprises two central processing units (CPUs) or processors 104 and has connections for 
two input/output (I/O) ports (one for each processor 104) and six inter-processor (IP) 
network ports. The IP network ports are preferably referred to as North (N), South (S), 
East (E) and West (W) compass points and connect to two unidirectional links. The 
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North-South (NS) and East- West (EW) compass point connections create a (manhattan) 
grid, while the outside ends wrap-around and connect to each other, thereby forming the 
2D torus. The SMP system 100 further comprises a plurality of I/O subsystems 106. I/O 
traffic enters the processor modules 102 of the 2D torus via the I/O ports. Although only 
one I/O subsystem 106 is shown connected to each processor module 102, because each 
processor module 102 has two I/O ports, any given processor module 102 may be con- 
nected to two I/O subsystems 106 (i.e., each processor 104 may be connected to its own 
I/O subsystem 106). 

Fig. 2 is a schematic block diagram of a dual CPU (2P) module 102. As noted, 
each 2P module 102 preferably has two CPUs 104 each having connections 202 for the 
IP ("compass") network ports and an I/O port 204. The 2P module 102 also includes one 
or more power regulators 206, server management logic 208 and two memory subsys- 
tems 210 each coupled to a respective memory port (one for each CPU 104). The server 
management logic 208 cooperates with a server management system to control functions 
of the computer system 100 (Fig. 1), while the power regulators 206 control the flow of 
electrical power to the 2P module 102. Each of the N, S, E and W compass points along 
with the I/O and memory ports, moreover, preferably use clock-forwarding, i.e., for- 
warding clock signals with the data signals, to increase data transfer rates and reduce 
skew between the clock and data. 

Each CPU 104 of a 2P module 102 is preferably an "EV7" processor from Com- 
paq Computer Corp. of Houston, Texas, that includes part of an "EV6" processor as its 
core together with "wrapper" circuitry that comprises two memory controllers, an I/O 
interface and four network ports. In the illustrative embodiment, the EV7 address space 
is 44 physical address bits and supports up to 256 processors 104 and 256 I/O subsystems 
106. The EV6 core preferably incorporates a traditional reduced instruction set computer 
(RISC) load/store architecture. In the illustrative embodiment described herein, the EV6 
core is an Alpha® 21264 processor chip manufactured by Compaq Computer Corpora- 
tion of Houston, Texas, with the addition of a 1.75 megabyte (MB) 7-way associative in- 
ternal cache and "CBOX", the latter providing integrated cache controller functions to the 
EV7 processor. The EV7 processor also includes an "RBOX" that provides integrated 
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routing/networking control functions with respect to the compass points, and a "ZBOX" 
that provides integrated memory controller functions for controlling the memory subsys- 
tem 370. However, it will be apparent to those skilled in the art that other types of proc- 
essor chips may be advantageously used. 

Each memory subsystem 210 may be and/or may include one or more conven- 
tional or commercially available dynamic random access memory (DRAM), synchronous 
DRAM (SDRAM), double data rate SDRAM (DDR-SDRAM) or Rambus DRAM 
(RDRAM) memory devices. 

Each CPU 104 of the multiprocessor system 100 also includes a plurality of inter- 
nal control and status registers (CSRs). For example, each CPU has an internal interrupt 
queue register (not shown) for storing interrupts to be serviced by the processor. 

Fig. 3 is a schematic block diagram of an I/O subsystem 106. The subsystem 106 
includes an I/O bridge 302, which may be referred to as an "107", that constitutes a fun- 
damental building block of the I/O subsystem 106. The 107 302 is preferably imple- 
mented as an application specific integrated circuit (ASIC). As described above, each 
EV7 processor 104 includes one I/O port 204; however, there is no requirement that each 
processor 104 be connected to an I/O subsystem 106. 

The 107 302 comprises a North circuit region 304 that interfaces to the EV7 proc- 
essor 104 and a South circuit region 306 that includes a plurality of I/O data ports 308a-d 
(P0-P3) that preferably interface to standard I/O buses. An EV7 port 310 of the North 
region 304 couples to the EV7 processor 104 via two unidirectional, clock forwarded 
links 312. In the illustrative embodiment, three of the four I/O data ports 308a-c interface - 
to the well-known Peripheral Component Interface (PCI) and/or PCI-Extended (PCI-X) 
bus standards; while the fourth data port 304d interfaces to an Accelerated Graphics Port 
(AGP) bus standard. More specifically, ports P0-P2 include a PCI and/or PCI-X con- 
troller, such as controller 314 at port P0, which is coupled to and controls a respective 
PCI and/or PCI-X bus, such as bus 316. Attached to bus 316 may be one or more I/O 
controller cards, such as controllers 318, 320. Each I/O controller 318, 320, in turn, inter- 
faces to and is responsible for one or more I/O devices, such as I/O devices 322 and 324. 
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Port P3 may include an AGP controller (not shown) rather than a PCI or PCI-X controller 
for controlling an AGP bus. 

A cache coherent domain of the SMP system 100 preferably extends into the 107 
302 and, in particular, to I/O buffers or caches located within each I/O data port 308 of 
the 107 302. Specifically, the cache coherent domain extends to a write cache (WC) 326, 
a read cache (RC) 328 and a translation look-aside buffer (TLB) 330 located within each 
I/O data port 308. The WC 326 and RC 328, as well as TLB 466 function as coherent 
buffers. 

The South region 306 further includes an interrupt port 332 (P7). The interrupt 
port P7 collects PCI and/or AGP level sensitive interrupts (LSIs) and message signaled 
interrupts (MSIs) generated by I/O devices coupled to the other south ports P0-P3, and 
sends these interrupts to North region 306 for transmission to and servicing by the proc- 
essors 104 of the EV7 mesh. The interrupt port P7 also collects errors generated within 
the North and South regions 304, 306 of the 107 302. 

Fig. 4 is a schematic block diagram of port P0 in greater detail. Port P0 is cou- 
pled to a down hose 402 for receiving information from the EV7 mesh and an up hose 
404 for sending information into the EV7 mesh. Port P0 as well as ports P1-P3 and P7 
are coupled to the up hose 404 through a multiplexer (MUX) 406, which provides a sin- 
gle output to North region 304. Messages generated within any of the south ports P0-P3 
and P7 are received by and processed by the MUX 406 before transmission to North re- 
gion 304 (Fig. 3) and the EV7 mesh. The MUX 406 may include an up hose arbitration 
(arb) logic circuit.. . _ .. . 

Information received from the down hose 402 for port P0 is buffered at the read 
cache (RC) 328 which is configured to have a plurality of entries. Port P0 further in- 
cludes a down hose ordering engine 408 which is operatively coupled to the RC 328. 
The down hose ordering engine 408 maintains an index of the information contained in 
the RC 328, including whether that information corresponds to ordered or unordered 
transactions. Specifically, the down hose ordering engine 408 has an unordered queue 
410 and an ordered queue 412, which it uses to determine the order in which information 
in the RC 328 may be released to the intended recipient, e.g., an I/O device. For exam- 
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ple 5 upon receiving information from the down hose 402 and buffering that information 
in the RC 328, the down hose ordering engine sets a corresponding flag in the unordered 
queue 410 or in the ordered queues 412, depending on whether the received information 
corresponds to an unordered or an ordered transaction. 

If the information buffered at a particular entry of the RC 328 corresponds to an 
ordered transaction, then the down hose ordering engine 408 blocks the release of that 
information until all ordered transactions that were received earlier in time have been 
processed. Once all of the prior ordered transactions have been processed, the down hose 
ordering engine 408 releases, e.g., makes "visible", the subject information to the in- 
tended recipient. 

Fig. 5 is a schematic block diagram of a preferred interrupt control system 500 
which may be, at least in part, advantageously disposed at the interrupt port P7 of the 107 
302. The interrupt control system 500 includes an interrupt collector 502 having a plu- 
rality of interrupt wires or lines 504 for receiving interrupt signals, such as level sensitive 
interrupts (LSIs), from the I/O devices that may be coupled to the respective 107 302. 
Operatively coupled to the interrupt collector 502 is an interrupt controller 506. The in- 
terrupt controller 506 is preferably programmed to periodically poll the interrupt collector 
502 to determine which lines or wires 504 are asserted, and to retrieve that information, 
as indicated by arrow 508. The interrupt controller 506 also has access to a LSI register 
file 510. As described below, the interrupt controller 506 can read from and write to the 
LSI register file 510, as indicated by arrow 512. 

- - It should be understood that more than one I/O device may be connected to a 
given wire or line 504 leading into the interrupt collector 502. That is, interrupt lines 504 
can be shared among multiple devices and/or among different functions of multifunction 
devices. 

The interrupt controller 506 is also configured to generate interrupt messages or 
transactions, such as Write Internal Processor Register (WrIPR) messages or transactions, 
and send them to the MUX 406 (Fig. 4) for transmission, via the I07's North region 304, 
into the EV7 mesh, as indicated by arrow 514. The interrupt controller 506 can also re- 
ceive messages from the EV7 mesh as indicated by arrow 516. In addition, the interrupt 
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controller 506 may receive messages from the other ports P0-P3 of the 107, as indicated 
by arrow 518. 

The LSI register file 510 is preferably arranged, at least logically, as a table array 
having a plurality of columns and rows whose intersections define corresponding cells for 
storing information. In particular, the LSI register file 510 has a plurality of rows 520a-m 
corresponding to a wire or line 504 leading into the interrupt collector 502. The LSI reg- 
ister file 510 also has a plurality of columns for organizing information associated with 
each row 520a-m. Specifically, the LSI register file 510 has a first column 522 that con- 
tains a unique number for each row 520a-m. The interrupt number 522 may comprise 
three parts: a port ID 522a, a slot ID 522b and an interrupt designator (INTx) 522c. A 
second column 524 contains the identity of the target EV7 processor 104 that is to proc- 
ess the interrupt service routine of the interrupt identified by the respective row 520a-m. 
A third column 526 contains the thread of the EV7 processor 104 that is to process the 
interrupt service routine. A fourth column 528 indicates whether the interrupt of the re- 
spective row 520a-m is enabled or disabled. For example, the fourth column 528 may be 
1-bit. If the bit is asserted (e.g., set to "1") the respective interrupt is enabled. If the bit is 
de-asserted (e.g., set to "0"), the interrupt is disabled. A fifth column 530 indicates 
whether the respective interrupt is pending. Fifth column 530 may also be 1-bit and, if 
asserted, indicates that the interrupt is pending. If it is de-asserted, it indicates that the 
interrupt is not pending. 

The LSI register file 510 is preferably implemented as one or more registers dis- 
posed at the 107 302. These registers may be written to by the corresponding interrupt 
controller 506, by other I07s of the SMP system 100 and/or by one or more of the proc- 
essors 104. Nonetheless, those skilled in the art will recognize that the LSI register file 
510 may be implemented in other ways, such as through one or more memory devices, 
e.g., RAM. 

Figs. 6A and 6B is a flow diagram of the method of the present invention. Sup- 
pose a device, such as I/O device 324 (Fig. 3), at a given port of an I/O bridge, e.g., port 
P0 (Fig. 3) of the 107 302 requires some type of service from the EV7 mesh, as indicated 
at block 602. The I/O device 324 preferably signals a level sensitive interrupt (LSI) by 
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asserting the signal of its wire or line 504 (Fig. 5) leading to the interrupt collector 502 to 
the specified LSI level, as indicated at block 604. According to the PCI specification 
standard, LSIs are asserted low, i.e., negative true. Alternatively, I/O device 324 may 
issue a message signaled interrupt (MSI). During the next polling cycle, the interrupt 
controller 506 will "see" that a signal is present on a given line 504. This condition is 
detected by the interrupt controller 506, as indicated at block 606. For example, the lev- 
els of the wires or lines 504 may be captured by the interrupt collector 502 and moved to 
the interrupt controller 506 by one or more shift operations, utilizing one or more shift 
registers. 

Specifically, the interrupt collector 502 may include a parallel-load, serial shift 
register (not shown). The interrupt controller 506 periodically issues a signal to the col- 
lector 502 to load the register. The register is loaded in response to one or more interrupt 
lines 504 being asserted. The contents of the register are then serially shifted into the in- 
terrupt controller 506, i.e., into the 107 302. In this manner, the 107 302 can sample a 
large number of interrupt states, e.g., 32 per port P0-P3, with only a small number of 
pins, e.g., 3 pins. Each pin preferably supports 32-bits of interrupt state data. Each port 
P0-P3, moreover, preferably provides eight available slots and, under the PCI specifica- 
tion standard, each slot must be able to accept four interrupt signals, e.g., INTA, INTB, 
INTC and INTD. More than one cycle may be required to shift the contents of this reg- 
ister into the interrupt controller 506. Preferably, the interrupt controller determines 
which interrupts have been enabled based on the information at column 528 of the LSI 
register file 510, and only clocks the register a sufficient number of times to retrieve data 
for the enabled interrupts. 

In response to detecting the presence of an interrupt, the interrupt controller 506 
indexes to the appropriate entry or row, e.g., row 520i, of the LSI register file 510, which 
controls how the LSIs are processed. The controller 506 may first determine whether this 
particular interrupt is enabled by examining the corresponding entry from the interrupt 
enabled column 528 for row 520i. If the interrupt is not enabled, the interrupt controller 
506 stops. Assuming the interrupt is enabled, the interrupt controller 506 next examines 
the corresponding entry from the interrupt pending column 530 for entry 520i. This entry 
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should be "0", unless some other I/O device connected to this same wire or line 504 al- 
ready has an interrupt outstanding. 

The interrupt controller 506 then proceeds to post the interrupt from I/O device 
324 to the EV7 mesh, as indicated at block 608. Specifically, the interrupt controller 506 
generates a write message, such as a WrIPR message, that is directed to the interrupt 
queue register at the target EV7 processor 104 that is to service this interrupt. The inter- 
rupt controller 506 identifies the appropriate processor from the respective entry at the 
target EV7 column 524 for row 520i. Appended to the WrIPR is an interrupt identifier 
(IID), which the interrupt controller 506 generates by concatenating the following infor- 
mation: 



Interrupt Number EV7 Thread 



107 ID 



The interrupt number and EV7 thread are obtained from columns 522 and 524 of 
the LSI register file 510 for entry 520i. The interrupt controller 506 may be preconfig- 
ured with or otherwise have access to the ID of the 107. 

The interrupt controller 506 sends the message by passing it, via the MUX 406 
(Fig. 4), into the North region 304 from where it is sent via link 312 into the EV7 mesh. 
Within the EV7 mesh, the message is routed to the target processor 104 by means of the 
RBOX and the inter-processor (IP) network ports 202 of the various processors. Upon 
receiving the WrIPR message from the interrupt controller 506, the target processor 104 
stores the interrupt at its interrupt queue register, as indicated at block 610. The target 
processor 104 also issues an acknowledgment message, such as a Write Input/Output Ac- 
knowledgment (WrIOAck) message, to the interrupt controller 506 at the 107 302 to ac- 
knowledge its receipt of the interrupt, as also indicated at block 610. 

In response to the WrIOAck message, the interrupt controller 506 asserts the in- 
terrupt pending entry 530 for row 520i to reflect that the respective interrupt is pending 
service, as indicated at block 612. The target processor 104 then services, e.g., dis- 
patches, the interrupt, as indicated at block 614. To dispatch the interrupt, the target 
processor 104 preferably obtains the appropriate interrupt vector by using the IID as an 
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index, and executes the corresponding interrupt service routine. Upon completing the 
steps of the interrupt service routine, the target processor 104 generates an ordered mes- 
sage or transaction, such as an ordered, programmed 10 (PIO) message, informing the 
I/O device 324 that the processor has discharged the interrupt, as indicated at block 616. 
The target processor 104 then sends this ordered message to the I/O device 324 by di- 
recting it to port PO of the 107 302, as indicated at block 618. 

Within the SMP system 100, a message may be designated and thus recognized as 
either ordered or unordered by system agents, including the south ports of an 107 302, in 
several ways. For example, the SMP system 100 may be configured such that all mes- 
sages of a given type, e.g., all write messages, are processed in order. Alternatively, one 
or more fields and/or bits of a given message may be set in a predetermined manner, 
thereby designating the message as an ordered or an unordered message. 

In addition to the ordered PIO message, the target processor 104 generates a sec- 
ond ordered write message or transaction, such as an End of Interrupt (EOI) message, that 
directs the interrupt controller 506 at the 107 302 to clear the interrupt pending flag from 
column 530 for entry 520i, as indicated at block 620 (Fig. 6B). As described above, the 
interrupt pending flag is set in response to receiving a WrIOAck message from the target 
processor 104. This second write message has two significant characteristics. First, it is 
an ordered message that is sent only after the first write message is sent to the I/O device 
324, as indicated at block 622. Second, it is sent to the port, e.g., port P0, to which the 
I/O device 324 is coupled, and not to port P7 at which the interrupt controller 506 is lo- 
cated, as also indicated at block 622. The second write message is sent to port P0 even 
though it writes to a register disposed, at least logically, at port P7 and/or is intended to 
be read and acted upon by the interrupt controller 506 at port P7, rather than by some 
component disposed at port P0. 

The two ordered write message are received at the 107 302 in the same order in 
which they were sent via down hose 402, and, due to the way in which they were ad- 
dressed by the target processor 104, they are both are passed to port P0 for processing, as 
indicated at block 624. The messages are buffered in the read cache 328 and, since they 
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are both ordered messages, two respective flags are set in the ordered queue 412 to ensure 
that the first message is processed before the second. 

In particular, port PO transmits the first ordered message, i.e., the PIO message, to 
I/O device 324, as indicated at block 626. In particular, port PO hands the message to the 
PCI controller 314 which drives it onto the PCI bus 316 for receipt by device 324. De- 
vice 324 receives the PIO message which informs it that the EV7 mesh has serviced its 
interrupt. In response, device 324 deasserts its interrupt line 504, as indicated at block 
628. After sending the first ordered message, e.g., after driving the message onto the PCI 
bus 316, port PO sends the second ordered message buffered at the read cache 328 over to 
port P7, as indicated at block 630. That is, the ordering engine 408 recognizes the target 
of the second message as port P7, but must wait until all ordered transaction received 
earlier in time, including the PIO message, have been processed before sending the sec- 
ond message to port P7. As the 107 302 is preferably an ASIC, the second ordered mes- 
sage is transmitted to port P7 via an internal communication facility of the ASIC, such as 
a wire or line. 

The second message instructs the interrupt controller 506 to deassert the interrupt 
pending entry of column 530 for row 520i, as indicated by block 632, thereby reflecting 
that the respective interrupt has been dispatched. In the preferred embodiment, the inter- 
rupt controller 506 then waits a predetermined time, e.g., one or more clock cycles, be- 
fore again checking the status of the interrupt lines 504 leading to the interrupt collector 
502, as indicated at block 634. This ensures that the I/O device 324 has sufficient time to 
receive die first write message off of the PCI bus 316 and to deassert its interrupt line 
504. Accordingly, if the same interrupt line 504 is still asserted even after waiting the 
predetermined time, then the interrupt controller 506 knows that it is the result of some 
other interrupt, e.g., an interrupt issued by another device, e.g., I/O device 322, which 
shares this same interrupt line 504, or another request for service by the same device, i.e. 
I/O device 324. 

As shown, by issuing two ordered messages to the port to which the respective 
I/O device is coupled, the present invention ensures that the I/O device will deassert its 
interrupt signal before the EOI message is received and processed at the interrupt con- 
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troller 506. As a result, the passive release of interrupts is avoided. Thus, system re- 
sources are not spent attending to interrupts that have already been dispatched. 

It should be understood that the two write message may generated by the target 
processor 104 in any order but must be sent in the order described above. As indicated 
above, moreover, the second write message may clear the interrupt pending flag directly 
or it may be a command message directing the interrupt controller 506 to clear the flag. 
That is, the second message may be a write transaction targeting the respective 107 reg- 
ister at which the respective interrupt pending flag is stored. 

It should be further understood that the interrupt controller 506, after receiving the 
second write message, need not wait the predetermined time before polling the interrupt 
collector 502 again. 

It should also be understood that the SMP system 100 may take other forms be- 
sides a 2D torus. 

The foregoing description has been directed to specific embodiments of the pres- 
ent invention. It will be apparent, however, that other variations and modifications may 
be made to the described embodiments, with the attainment of some or all of their ad- 
vantages. For example, the present invention could be implemented in single processor 
computer systems. Therefore, it is the object of the appended claims to cover all such 
variations and modifications as come within the true spirit and scope of the invention. 

What is claimed is: 
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